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The two-point correlation function has been the standard statistic for quantifying 
how galaxies are clustered. The statistic uses the positions of galaxies, but not their 
2^ ! properties. Clustering as a function of galaxy property, be it type, luminosity, color, 

etc., is usually studied by analysing a subset of the full population, the galaxies in 
the subset chosen because they have a similar range of properties. We explore an 
alternative technique — marked correlations — in which one weights galaxies by some 
• property or 'mark' when measuring clustering statistics. Marked correlations are par- 

I ticularly well-suited to quantifying how the properties of galaxies correlate with their 

, environment. Therefore, measurements of marked statistics, with luminosity, stellar 

mass, color, star-formation rate, etc. as the mark, permit sensitive tests of galaxy 
formation models. We make measurements of such marked statistics in semi-analytic 
■ galaxy formation models to illustrate their utility. These measurements show that 

' close pairs of galaxies are expected to be red, to have larger stellar masses, and to 

r~| . have smaller star formation rates. We also show that the simplest unbiased estimator 

Q^' of the particular marked statistic we use extensively is very simple to measure — it 

does not require construction of a random catalog — and provide an estimate of its 
variance. Large wide-field surveys of the sky are revolutionizing our view of galaxies 
^ and how they evolve. Our results indicate that application of marked statistics to this 

^ I high quantity of high-quality data will provide a wealth of information about galaxy 

formation. 
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1 INTRODUCTION 

It has been thirty five years since Totsuji & Kihara (1969) 
published their measurement of the galaxy correlation func- 
tion. Since then, measurements in ever larger catalogs have 
shown their initial estimate of ^(r) oc r~^ ** on scales smaller 
than about ten Megaparsecs was accurate (e.g., Maddox et 
al. 1990; Percival et al. 2001; Hamilton & Tegmark 2002; 
Connolly et al. 2002). Recent work in the 2dFGRS (CoUess 
et al. 2001) and SDSS (Abazajian et al. 2005) surveys has 
shown definitively that clustering depends on galaxy type, 
color and luminosity (Norberg et al. 2002; Zehavi et al. 2002, 
2005). In what follows, we will speak of any combination of 
properties which distinguish galaxies from one another as 
marks. 

Marks can take discrete or continuous values, and they 
may be scalars or vectors. The traditional morphological 
type of a galaxy is a discrete scalar mark. Luminosity, color. 
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X-ray hardness, AGN activity and star formation rate are 
all examples of continuous scalar marks. A typical neural 
network classifies morphological types as a list of probabili- 
ties p — {pi, ■ ■ ■ ,Pn), where pi is the probability a galaxy is 
of type Ti (e.g., Storrie-Lombardi et al. 1992; Lahav et al. 
1996). If one classified galaxies using this probability vec- 
tor (rather than simply using the type Ti which had the 
largest pi), then morphological type is a vector mark. Prin- 
cipal component analyses of galaxy spectra output a vector 
c = (ci, • • ■ , c„), where d denotes the contribution of eigen- 
spectrum i to the object's spectrum (e.g. Connolly et al. 
1995). The spectral classification c can, therefore, be used 
as a vector mark. 

In most analyses to date (e.g., Hamilton 1988; Mo, 
Borner & Zhou 1989; Willmer, da Costa & Pellegrini 1998; 
Benoist et al. 1999; Norberg et al. 2002; Zehavi et al. 2005), 
the parent galaxy catalog is cut into subsamples based on 
the mark (most often morphological type or luminosity), and 
the clustering in the subsample is studied by treating each 
galaxy in it equally. That is to say, the mark is used to define 
the subsample, but is not considered further. This is equiva- 
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lent to relabeling all marks to be 'zeros' or 'ones', depending 
on whether or not the value of the mark crossed a threshold 
value, and then computing the correlation function of the 
galaxies marked as 'ones'. There are two reasons why this 
procedure is not ideal. First, the choice of critical thresh- 
old is somewhat arbitrary, particularly in the case of marks 
like luminosity and color which take a continuous range of 
values. Second, throwing away the actual value of the mark 
represents a loss of information. Therefore, one might well 
ask: Why not weight each galaxy by its actual mark, rather 
than relabelling to ones and zeros? 

With the advent of large wide-field space- and ground- 
based surveys of the sky, many of the marks mentioned 
above can now be measured with sufficient precision that 
contamination by measurement error is not a serious con- 
cern. Indeed, many of these marks have been, or can be, 
reliably measured in a number of datasets now available. 
For example, the luminosities in the 2MASS survey are ex- 
pected to be indicators of total stellar mass; the ROSAT 
all sky survey can be used to estimate the hardness of the 
spectra of X-ray sources; the GALEX mission will provide 
estimates of star formation rates out to redshifts of order 
unity. The time is ripe to take advantage of the additional 
information provided by weighting each galaxy by its mark. 

Marked correlations in the SSRS2, IRAS 1.2 Jy and 
PSCz surveys, using luminosity and/or type as mark were 
measured by Beisbart & Kerscher (2000). The marked cor- 
relations showed evidence that the luminosities of close pairs 
of galaxies were larger than the mean on separations larger 
than about 10/i~^Mpc. Beisbart & Kerscher did not com- 
pare their measurements with specific model predictions, so 
their results have had less impact on galaxy formation mod- 
els than they might otherwise have had. 

Semi-analytic galaxy formation models (White & Rees 
1979; White & Frenk 1991; KauflFmann et al. 1999; 
SomerviUe & Primack 1999; Cole et al. 2000; Springel et 
al. 2005) provide the basis for our current understanding of 
how and why different galaxy types are differently biased 
tracers of the dark matter field. They also provide a useful 
framework for seeing if simple models are consistent with 
the observed correlations between luminosity and velocity 
dispersion or circular velocity (Faber & Jackson 1976; TuUy 
& Fisher 1977), morphology and density (Dressier 1980), 
morphology or density and star formation rate (Lewis et al. 
2002; Gomez et al. 2003; Kaufi'mann et al. 2004), etc. 

Although the predictions of these models depend on a 
number of free parameters, and they do not always provide 
perfect descriptions of the data, they have proven to be use- 
ful for gaining insight into how changes in the physics of 
galaxy formation affect the properties of the galaxy popula- 
tion. These models output a number of marks which are po- 
tentially observable. These include luminosity, size, velocity 
dispersion, morphology, star-formation rate, local density, 
and so on, and how these marks evolve. The main goal of 
the present paper is to measure various marked correlation 
functions in these models, so as to illustrate how marked 
statistics can provide direct physical insight into the pro- 
cesses of galaxy formation. 

Section |5| introduces the family of pairwise marked cor- 
relation functions, and defines the particular generalization 
of the usual correlation function we will use. Sectionj^shows 
measurements of various marked correlation functions made 



in semi-analytic models of galaxy formation. Section 2] dis- 
cusses the efi'ects of rescaling marks prior to measuring 
marked statistics. Such rescaling is necessary if one wishes 
to compare galaxy formation models with observations, if, 
as is often the case, the distribution of marks in the models 
is not the same as in the data. A final section summarizes 
our findings, and discusses some extensions. An Appendix 
provides a brief discussion of the bias and variance of our 
estimators of marked statistics (our estimators are unbiased, 
but they are not minimum variance) . In essence, the estima- 
tor we use throughout this paper is particularly straightfor- 
ward to implement because it does not require construction 
of a random catalog; in this respect, marked statistics are 
substantially easier to estimate than the usual unweighted 
correlation functions. 



2 MARKED STATISTICS 

Marked correlations measure the clustering of marks. Since 
the positions at which the marks are measured may them- 
selves be clustered, marked statistics are defined in a way 
which accounts for this. This is the subject of Sections 12.11 
and l2.4l Marked statistics are usually studied to see if marks 
depend on environment. Section 12.61 shows that marked 
statistics have another very useful application which has 
not been emphasized previously. Namely, different marks fre- 
quently correlate with each other: marked statistics can be 
used to quantify if the correlations between marks depend 
on environment. 



2.1 Definitions 

For example, let p denote the mean density of particles, and 
let m denote the mean mark, averaged over all particles. Now 
consider a particle with mark larger than this mean value. 
Are the particles neighbouring it also likely to have larger 
marks? One way to quantify how likely this is is to compute 
the ratio of the mean mark to m of pairs of particles as 
a function of pair separation. The typical number of pairs 
at separation r is p^W + (,{r)], where ^ is the two point 
correlation function. Therefore, the mean mark is 



Mi{r) = 



^[m(a:) -I- m{y)]T{\x - y\ - r) 

2m ^T(|a; — y|— r) 
l][m(a:) + ■m{y)]I{\x - y\ - r) 
2mp2[l + 5(r)] 



(1) 



where T{x) — unless a; = 0, and the sum is over all galaxy 
pairs. We have divided by m, so 7Vfi(r) — 1 for all r if 
there are no correlations between marks. Analogously, the 
nth-order mark is defined by 



Mnir) 



j:[m{x)+miy)ri{\x-y\-r) 
{2m)" p^[l + £,{r)] 



(2) 



In what follows, we will concentrate on the mean square 
mark 



M2{r) 



Y.[m'\x) + n?{y) + 2m(x)m{y)]I(\x - y| - r) 



4m2 p2[l + ?(r)] 

If all the marks are the same, then A^2 = 1- If the marks are 
independent, with mean mi = m and mean square m2, then 
M2 = (m2 -|-mf)/2m^ = (m^lm^ 4- 1)/2. To see why this is 
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sensible, compute the variance: Var(r) = M2{T)—Mi{r)'^ = 
(m^lfr? — 1)/2. This shows that Var equals half the variance 
of the individual marks, as it should (recall it is the variance 
of one half times the sum of two independent marks). 

Two types of terms contribute to 7Vl2- The term which 
involves a product of two marks is proportional to 

= E"i(a:)m(j/)J(|x-j/|-r) ^ 1 + VK(r) 

where we have defined W(r) for the following reason. The 
only difference between the sums in the numerator and de- 
nominator is that, in the numerator, the ith particle con- 
tributes a weight rrii/ffi, whereas in the denominator, the 
weight is unity for all particles. Since the denominator is 
one plus the usual unweighted correlation function, the nu- 
merator can be thought of as being one plus a weighted 
correlation function. In effect, the denominator divides-out 
the contribution to the weighted correlation function which 
comes from the spatial distribution of the points, leaving 
only the contribution from the fluctuations of the marks. 

2.2 Estimators and the weighted correlation 
function 

There are a number of reasons why this ratio, M(r), is 
the measure of marked correlations on which we will con- 
centrate. First, fast estimators of unweighted correlations 
(which account for edge effecs, etc.) have been available for 
some time (e.g., Peebles 1980). If we write the usual esti- 
mator for the unweighted correlation function as DD/RR, 
then the weighted correlation function is WW/RR, and the 
marked statistic above is simply WW/DD. This has two 
important implications. First, allowing for weights requires 
only a simple modification of algorithms which estimate un- 
weighted correlations (e.g. Moore et al. 2000). Second, be- 
cause edge effects appear in both the numerator and denom- 
inator, this marked correlation ratio is less sensitive to edge 
effects than is, e.g., the DD/RR estimator of the unweighted 
correlation function (e.g. Beisbart & Kerscher 2000). Per- 
haps more importantly, note that the RR factor cancels out, 
so that the marked statistic can be estimated without con- 
structing a random catalog; this significantly reduces the 
computational burden for estimating the statistic. Third, er- 
rors on the measurement are readily estimated, as described 
in Appendix^ (It is relatively common to approximate the 
errors by remaking the measurement after randomizing the 
marks, and repeating a large number of times. We show 
that this is a reasonable approximation on the small scales 
where we find the most interesting results, but leads to an 
underestimate of the true errors on larger scales.) And fi- 
nally, it is by thinking of marked correlations as this ratio 
of weighted to unweighted correlations that one is able to 
construct theoretical models of marked correlations (Sheth, 
Abbas & Skibba 2004; Sheth 2005). 

2.3 Measurement errors 

If the mark is not perfectly measured, so the observed mark 
Oi of object i is Oi = rrii + e^, where is the error in the 
measurement, then the measured marked correlation func- 
tion will be Mo{r) = {{mi + ei){mj+ej)\r) , where the nota- 
tion denotes the average over all pairs i and j separated by 



r. Thus, Moir) — Mm{r) + Me{r) plus a term which allows 
for the fact that the mark of one object is correlated with 
the error made in measuring the marks of the other objects. 
If there is no such correlation, and if the mean measurement 
error for objects does not depend on position, and if the 
errors themselves are uncorrelated, then Mo{r) — Mm{r): 
in this case, the measured marked correlation will be an 
unbiased representation of the true correlation. Of course, 
the precision with which this marked correlation function 
can be determined will depend on the amplitudes of the as. 
Appendix^ provides a more detailed discussion. 

2.4 Normalization 

The expressions above follow the conventions in the statis- 
tical literature (e.g. Stoyan 1984; Stoyan & Stoyan 1994), 
and so normalize the marks by the mean mark. There is no 
apriori reason for this choice. For instance, we could have 
chosen to normalize by the rms value instead. In models 
where large scale correlations are expected to be small, our 
convention makes the large scale limit M(r) 1, which 
is intuitively easy to understand. Normalizing by the square 
root of the mean squared mark instead makes the amplitude 
of the small-scale signal easier to understand, since this con- 
vention would have M{r ^ 0) 1, but then the large scale 
value would depend on the ratio of the mean and rms val- 
ues of the marks. This convention might be useful if one 
is interested in studying marked statistics in a density field 
which was obtained by smoothing the mark-weighted point 
distribution. 

Moreover, the expressions above implicitly assume that 
m is not zero. If marks are not positive definite, then rn 7^ 
is not guaranteed, and one may well have to normalize by the 
rms value. To avoid this complication, in what follows, we 
will only consider marks which are positive definite. Analysis 
in Section|31suggests that monotonic rescalings of the marks 
will not seriously compromise their use. Thus, for instance, 
when we use galaxy colors (which are proportional to the log 
of the ratio of the luminosity in two different wavebands) as 
marks, we use the ratio of the luminosities, rather than the 
log of this ratio. 

2.5 Cross-correlations 

So far, we have implicitly assumed that the mark used for 
both particles was the same. There is no reason why we 
could not have used different marks for the two particles of 
each pair. For example, we could use effective radius and 
local density as two marks to study if galaxies in denser 
regions are smaller or larger than average. Marked cross- 
correlations could also be used to study if star formation 
rates in galaxies are affected by AGN activity in their neigh- 
bors. To estimate cross correlations of this type, the expres- 
sions given previously remain true, with the replacements 
m{x)m{y) / fh^ — » ma(a;)mb(y)/mami,, where the subscripts 
a and b denote the two different types of marks. 

2.6 Correlations between marks 

Many galaxy observables correlate with each other: e.g., the 
velocity dispersion of a galaxy correlates with its size, lumi- 
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nosity, and color. Marked statistics provide a simple way to 
see if such correlations depend on environment. Note that 
this differs from the previous subsection which described 
marked cross-correlations between a mark of one galaxy 
and another mark of its neighbour; here we are interested 
in quantifying how the correlation between two (or more) 
marks of the same galaxy depends on the surrounding envi- 
ronment. 

Suppose each galaxy has two weights, m and w, and 
that a plot of the two versus one another (each galaxy is a 
point in this two-dimensional plot) shows a correlation, but 
there is scatter around the mean correlation. Let w — f{m) 
denote this mean correlation. A question which often arises 
is: Does the scatter correlate with environment? If we write 
Wi = fi + a, where fi = f{mi), then 

in + e^] [/, + e,] I 



' w w \ / \ f f 

P 

where the notation is intended to show that the averages are 
over all pairs with separation r. Evidently, is a (suitably 
weighted) combination of Mf, Me, and the cross-correlation 
between / and e. If the amount by which galaxy i scatters 
from the mean w — m relation is independent of the marks 
rrij of the other galaxies, then {eifj\r) = {ei\r) {fj\r} and 
{eiCjlr} = {ei\r}{ej\r} . If the scatter is independent of envi- 
ronment, then (ei|r) — for all r. This shows that, if there 
is no correlation in the scatter, then Mu,(r) —> Mf(^^-){r). 
Hence, one can determine if the correlation between marks 
depends on scale by comparing Mw{r) with A/y(„)(r). If 
Mmir) depends on scale, then = Mm only if /(m) oc m. 
If f{m) is a more complicated function of m, then dif- 
fers from Mm by a scale dependent factor even if /(m) is 
independent of r. 



2.7 An illustrative example 

Figure shows an example of what marked correlation 
functions measure. The underlying point process was the 
halo distribution in the Hubble Volume simulation (Evrard 
et al. 2002) of a flat ACDM cosmology {ilo = 0.3, h = 
0.7, erg = 0.9). The dashed lines in the top panels show the 
unweighted correlation function of halos with masses in the 
range 1.12x lO^M©//!. <M< 1.57x IO^^Mq/Zi (left, about 
2 X lO'^ halos) and greater than 6.72 x IO^Mq/Zi (right, 
2.5 X 10* halos). The solid lines in the top panels show the 
result of weighting each halo by the ratio of its mass to the 
mean halo mass in the subsample when computing the cor- 
relation function. Comparison of the two panels shows that 
on large scales, the more massive halos are more strongly 
clustered. The decrease at small r is a consequence of vol- 
ume exclusion: halos do not overlap. The more massive halos 
occupy larger volumes, so volume exclusion matters (i.e., the 
correlation functions turn over) at larger scales in the panel 
on the right (this effect was discussed by Mo & White 1996, 
and quantified by Sheth & Lemson 1999). 

Let C(r) and W{r) denote the dashed and solid curves 
in the top panels. The symbols with solid lines drawn 
through them in the bottom panels show the marked corre- 
lations M(r) = [14- iy(r)]/[l -f- C(r)], where the mark is the 
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Figure 1. Correlation functions of massive halos in numerical 
simulations of a ACDM cosmology. Top panels show the num- 
ber (clashed) and mass (solid) weighted correlation functions, and 
panels on the bottom show marked correlations (crosses). The 
dotted curves in the bottom panels show the mean plus and minus 
the standard deviation of measurements made after randomizing 
the marks. 



halo mass. There is no trend in the panel on the left, because 
the range of halo masses in it is small. As a result, almost 
all halos have the same mass, and hence the same mark. In 
contrast, the symbols in the panel on the right show a hump 
at 10h~ Mpc, indicating that the most massive halos tend 
to have separations of about that scale. 

One of the virtues of marked correlations is that there 
is a simple way to assess the statistical significance of such 
a feature. The approximately horizontal curves bounded by 
dotted lines in the bottom panels show the mean and the 
standard deviation around the mean when the same mea- 
surement is made after randomizing the marks. (The curves 
show results averaged over one hundred random realiza- 
tions.) Comparison of the symbols with these randomized 
curves shows that the hump at 10/i~^Mpc in the panel on 
the right is statistically significant. (Appendix A provides 
a more careful discussion of the uncertainty on the mea- 
sured statistic, and shows that this randomization procedure 
slightly mis-estimates the true scatter, since it assumes there 
are no correlations between the marks. However, since the 
correlations between marks we find are not very different 
from unity, this estimate based on randomization is not far 
from the true answer.) 

Although Af (r) ~ 1 at large scales, it drops below unity 
on scales smaller than 5ft~^Mpc. This indicates that the ha- 
los which contribute pairs at small separations are the least 
massive subset of the population, as one would expect from 
volume exclusion. Notice that the marked correlations illus- 
trated this effect without having to separate the catalog up 
into small bins in mass. Therefore, they allow a measurement 
of what is in this case a 'mass segregation' effect which is of 
greater statistical significance than it would be in any one 
of the smaller catalogs. 
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Figure 2. Marked correlations in tlie semi-analytic models with B, I and /C-band luminosity at 2 = 0.20 as the mark. In each panel, 
jagged solid line shows the marked correlation, and error bars show the estimated uncertainty on the measurement derived in Appendix A. 
For comparison, horizontal line shows the mean obtained by averaging over one hundred realizations of the same measurement after 
randomizing the marks, and dotted lines show the standard deviation around this mean. When luminosity is the mark, the marked 
correlation is a strong function of wavelength. 



3 PREDICTIONS OF SEMI-ANALYTIC 
GALAXY FORMATION MODELS 

It is straightforward to make similar measurements, us- 
ing various marks, in semi-analytic galaxy catalogs. In 
what follows, we will use the galaxy formation models of 
Kauffmann et al. (1999) to illustrate the sort of informa- 
tion that marked correlations provide. This is not because 
we feel other models are significantly worse, but because 
these model galaxy catalogs are available to the public 
(www.mpa-garching.mpg.de/Virgo). The models populate 
the dark matter distribution of the GIF simulation with 
galaxies: the background cosmology is a fiat ACDM universe 
with (57, /i, ag) = (0.3,0.7,0.9), and the simulation box is a 
cubical comoving volume 141/i~^Mpc on a side. All distances 
we show below are comoving. 

We show measurements of marked statistics in these 
semi-analytic galaxy catalogs for subsamples of galaxies 
which contain more than 2 x lQ^'^h~^MQ in stars. Such 
measurements are best thought of as being related to mea- 
surements in volume-limited, rather than magnitude-limited 
galaxy catalogs. At z = 0.2 (this redshift was chosen to ap- 
proximately match the median redshifts of the Two Degree 
Field Galaxy Redshift Survey and the Sloan Digital Sky 
Survey; c.f. CoUess et al. 2001 and York et al. 2000) the 
semi-analytic galaxy catalog contains 14,665 objects. This 
number is about thirty percent smaller at z = 1.05. 



3.1 Luminosity 

Figure 121 shows marked correlations in which luminosity was 
used as the mark. The various panels show measurements 



made in the semi-analytic models at 2; = 0.2, at various 
wavelengths. In each panel, the jagged lines show the mea- 
surement, and error bars are estimated following the anal- 
ysis in Appendix A. The figure shows clear evidence that 
the shape of the marked correlation function depends on 
wavelength. In all cases, Mi,uin{r) falls gently at separations 
larger than about 3 Mpc, but the small scale behaviour is 
very different. It is interesting that this 3 Mpc scale is simi- 
lar to that at which ^(r) shows a feature: this is the scale on 
which the statistic becomes dominated by pairs in separate 
halos. 

Comparison of the curves in the different panels shows 
that while all bands are reasonably similar on larger scales, 
the different bands are quite different from each other on 
smaller scales. In contrast to the redder band-passes, the 
marked correlation decreases with scale for the B— band. 
Evidently, the blue-band luminosities of galaxies with near 
neighbours (i.e., closer than a Mpc or so) are smaller than 
they are on average. Since the _B— band luminosity is more 
sensitive to recent star formation than the /— or Tf— bands, 
it is interesting to study what happens when star-formation 
rate is used as the mark. 

We will make two additional points before we do so. 
First, although we have not shown this, we have found that 
if we restrict attention to any one band, then the amplitude 
of M{r), especially on small scales, depends on the range of 
luminosities in the sample. In general, increasing the range 
of luminosities makes the small-scale dependence of M(r) 
on r more dramatic. The reason for this was hinted at in 
Section 12.41 if close pairs tend to have larger marks, then 
the amplitude of M{r) depends on the range of luminosities 
in the sample. 
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Figure 3. Comparison of various estimates of the uncertainty on 
the measurement; the three panels show the same set of marks 
as in Figure |^ Larger error bars show the full estimate from 
Appendix A; smaller error bars show the result of ignoring the 
contribution from equation lAlll . The standard deviation of the 
marked statistic measured in many (one hundred) realizations of 
the point distribution with randomized marks is shown by the 
dotted curves. 

Second, consider the horizontal solid line at M ~ 1 on 
all scales in each panel. This line shows the mean value of the 
statistic averaged over one hundred realizations of the cata- 
log after randomizing the marks. The dotted lines show the 
standard deviation around this mean, and this standard de- 
viation is sometimes used as an estimate of the error on the 
measurement of M(r). Notice that the difference between 
the solid and dotted lines is similar to the size of the error 
bars on the measurement, indicating that this randomized- 
marks procedure of estimating errors does not grossly mis- 
estimate the true errors, at least on small scales. 

Figure |3 provides a more direct comparison. Dotted 
curves show the error estimate from randomizing the marks. 
The two sets of error bars show the two contributions to 
the error estimate from Appendix A; the smaller estimate 
comes from ignoring the contribution from equation (lAlH . 
On small scales, the dotted lines and full error bars are in 
good agreement, except in the B-band. In B-, M{r) itself is 
smaller than unity, so the error estimate from Appendix A 
is also smaller than the one obtained from randomizing the 
marks. On larger scales, the estimate from Appendix A be- 
comes significantly larger than the one from randomizing the 
marks. Since M(r) itself is unity on these scales, this differ- 
ence strongly suggests that our estimator, WW/ DD, can be 
improved — while unbiased, is not minimum variance. This 
hypothesis is supported by the fact that, in this regime, the 
error estimate is dominated by the term in equation l|All|l . 

3.2 Colors and star-formation 

Figure |4] shows the marked correlations of the same set of 
objects when color and star formation rate are used as marks 
(curves with error bars) . As in the previous figure, the solid 
and dotted curves show the mean (averaged over one hun- 
dred realizations) and the standard deviation around the 
mean when the same measurement is made after random- 
izing the marks. The error estimates indicate that, at sep- 
arations smaller than a Mpc or so, star formation rates are 
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Figure 4. Marked correlations at z = 0.2 and z = 1.05 in the GIF 
semi-analytic galaxy formation models (Kauffmann et al. 1999). 
Top and bottom panels show results when the mark is restframe 
V—I color and star formation rate, respectively. Curves with error 
bars show the marked correlations with associated uncertainties. 
Horizontal lines show the mean of one hundred realizations of the 
same measurement after randomizing the marks, and dotted lines 
show the standard deviation around this mean. 



substantially lower than the mean, whereas the V — I colors 
of close pairs are larger (i.e., redder) than the mean. 

Comparison of the panels on the left with those on the 
right shows how the marked correlations evolve. Notice that 
the two marked correlations depend very differently on scale, 
suggesting that color and star-formation rate are biased very 
differently relative to the underlying dark matter distribu- 
tion. On the other hand, the scale on which Mv-i{r) sud- 
denly increases is similar to the scale on which MsFH,(r) 
decreases, suggesting that similar physics gives rise to both 
effects. 

Why does this happen? Many close pairs come from 
galaxies in groups and clusters. In the models, such galaxies 
started forming their stars at higher redshifts than average. 
By the present time they are likely to have turned all the gas 
available to them into stars. Therefore, at low redshifts (e.g, 
z < 1), the star formation rates in cluster galaxies are low, 
and their colors are red. Figure |2 shows that the marked 
correlation functions are sensitive to these effects. Since the 
top panel is weighted by V—I, and redder colors mean larger 
values oi V — I, the marked correlations in the top panels 
show a dramatic increase on scales smaller than the typical 
virial radius of a large cluster. Since star formation within 
clusters has essentially switched off, the trend in the bottom 
is the opposite. Thus, in the models, it is no accident that 
the scale on which My-i suddenly increases is similar to the 
scale on which Msfr decreases. 

Notice that the scale at which the two curves suddenly 
change is smaller at higher redshifts, and that the shift in 
scale is easily detected in both cases. This is not unexpected 
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Figure 5. Marked correlations in which stellar mass is the mark 
(solid lines with error bars). For comparison, crosses in the panel 
on the left show what happens when /f— band luminosity is the 
mark. 

in models where star-formation is lowest in the most massive 
halos. Comparison with similar measurements in the data 
will show if these predictions of the models are correct. If 
not, the gastrophysics which has been used to model galaxy 
formation should be refined. For instance, GALEX observes 
galaxies out to redshifts of order unity, so it is well suited to 
performing this test. 

3.3 Stellar mass and the mass-to-light ratio 

We were prompted to study the star-formation rate because 
of the differences between the various panels of Figure|21 The 
discussion above suggests that the drop on scales smaller 
than an Mpc in the B-band is consistent with Figure^Jwhich 
showed a decrease in the star formation rate on small scales. 
Luminosities in the redder bands are less sensitive to recent 
star formation, so they are expected to be better tracers of 
the total stellar mass. 

To illustrate, the curves with error bars in Figure|S]show 
marked correlations in the semi-analytic models ai z = 0.2 
and z — 1.05 when the stellar mass is used as mark. Com- 
parison of the two panels shows that M{r) evolves: at both 
redshifts, close pairs tend to more massive, but the scale on 
which this is a significant effect is smaller at high redshift. 
Moreover, the amplitude of M{r) also evolves: stellar masses 
of close pairs at high redshift were not as different from the 
mean as they are today. The shift in amplitude and in scale 
means that similar measurements in data should provide 
sharp constraints on the models. 

The crosses in the panel on the left show the result of 
using the if— band luminosity as the mark (for clarity, we 
have not shown the associated error estimates, since these 
are shown in Figure |5J . Notice that the same bumps and 
wiggles are present for both Mk and Mstars- This sug- 
gests that the two marks, if-band luminosity and stellar 
meiss, do indeed refiect similar physics. The discussion in 
Section 12.61 indicates that comparison of these two marked 
correlations can be used to study if the correlation between 
stellar mass and Lk is the same in all environments. In par- 
ticular, if Lk is linearly proportional to stellar mass, then 
the two marked correlations will be the same only if this 
correlation is independent of environment. (Specifically, if 
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Figure 6. Correlation between K-hand luminosity and stellar 
mass. Since this correlation is linear, the two marked correlations 
will be the same only if there are no environmental effects. 

Li — aMi+ei with (L) — a{M), then the marked statistic is 
{L^L2\r)/a^{Mf = {M^M2\r)/{Mf + 2{Mie2\r} /a{Mf + 
{eie2\r) /a^ (M)'^; the final two terms vanish only if the cor- 
relation is independent of environment.) 

Figure |H] shows that Lk oc Mstars (we have normalized 
each by their respective mean values across the population). 
Hence, the difference between the two curves in the left panel 
of Figure |5] suggests that the correlation between mass and 
light does depend on environment: close pairs tend to have 
larger mass-to-light ratios, suggesting that the mass-to- light 
ratio is larger in dense regions. 



4 THE EFFECTS OF RE-SCALING 

The semi-analytic models studied here do not match the 
observed distribution of galaxy properties precisely. For in- 
stance, these models do not match the SDSS luminosity 
function. On the other hand, they are reasonably success- 
ful at describing the luminosity dependence of clustering. 
For this reason, we envision that constraints on such mod- 
els will take the following form. If the models successfully 
describe the luminosity dependence of clustering, then they 
have successfully identified the halos in which galaxies form. 
If marked correlations are not changed in any essential way if 
the absolute values of the weights are altered without chang- 
ing the rank ordering, then marked statistics provide a way 
of determining if, in addition to correctly identifying the 
halos in which galaxies form, the models also determine the 
correct rank ordering of galaxy properties within halos. 

For such a test to work, we must understand the ef- 
fect that monotonic rescalings of the marks have on marked 
statistics. With this in mind, consider the following toy 
model. All galaxies come in either of two types of groups. 
Groups of one type contain mi galaxies, each having weight 
Wi, and the density run of galaxies around centres the 
group centers follows a Gaussian distribution with scale 
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Figure 7. Marked correlation function for the model described in 
the text. Note that the complex scale dependence of the statistic 
is the same for a range of values of wi , suggesting that although 
rescaling marks may change the statistic quantitatively, it does 
not result in qualitative changes. 



1/3 

length si oc . Galaxies in groups of the second type 
follow a Gaussian distribution with scale length S2, have 
weights W2, and there are 7712 oc S2 galaxies per group. 
Specifically, the density run around each type of group is 
p{r)/p = Ani exp(-rV2/s?)/^/2^, so m, = Anip27rs?- 

If / denotes the fraction of groups of type 1, then the 
mean weight is W" = [fmiWi + (1 — f)m2W2]/[fmi + (1 — 
f)m2]. Hence, the normalized weights are wi — Wi/W and 
W2 = W2/W = l + {l-wi)(si/s2f .f/{l-f). In this model, 

1 + W ^ 1 + kfwl e-'-'/^^i + fc(l - f){s2/sifw^2 e-'-'/^^i 
1+f l + fc/e-'-'/^^? +fc(l-,f)(s2/si)3e-'-'/4-i 

where k = [Ani/40F]/[/ + (1 - f){s2/sif]. Rescaling the 
marks corresponds to changing Wi; the fact that the marks 
are normalized then determines the rescaled value of VK2. 
Thus, given / and s\/s2, the dependence of the expres- 
sion above on wi gives the range of possible shapes that 
the marked statistic can possibly take. Figure |7| illustrates 
when KJi > 1. In this case, the marked statistic shows com- 
plex scale dependence, but this dependence is qualitatively 
similar for the different choices of wi. (We have not shown 
wi < 1; this case has (1 -I- W)/{1 + ^) greater than unity 
on all scales, with a maximum on intermediate scales. In 
this respect, the scale dependence is less complex than that 
shown in Figure |71 but again, it is qualitatively similar for 
all w < 1.) 

Figures |H| and |5| illustrate that the main features of the 
toy model are reproduced in the much more complex mark 
distributions studied in the main text. Figure |S] compares 
the rescaled distributions when B- and /("-band luminosity 
is the mark, and when the squares of the luminosities are 
the marks. Notice that although the distributions of normal- 
ized Lb and Lk are rather similar, the associated marked 
statistics shown in Figure|5|are very different. Note also that 
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Figure 8. Distribution of normalized marks, x/x, where x is the 
mean mark for x = Lg, L^, Lk and L-^. Notice that the distri- 
butions of normalized Lg and Lp^ are rather similar; nevertheless, 
the associated marked statistics shown in Figures|2]and|2lare very 
different. 



the normalized distributions of L% and L|c are substantially 
more skewed than those of L_g and Lk- 

Figure^lcompares the marked statistics when the mark 
is B, 7 or iiT-band luminosity (error bars), and when the 
mark is the square of the luminosity (filled squares). Note 
that both Ml and show similar scale dependence, in 

agreement with the toy model discussed above. In particular, 
note that the gross differences between the marked statistics 
when Lb and Lk are the marks are also obvious when Lb 
and L|(- are used as the marks. The jagged curves in the 
different panels show that M£ provides a good, but by no 
means perfect, approximation for . 

We remarked in Section r2.4l that monotonic rescaling of 
the marks may also be useful if one wishes to study marked 
statistics when the marks are not positive definite. A spe- 
cific example is galaxy color (although it happens that the 
distribution of F — 7 in the models we have studied in the 
previous section is positive definite, so that this was not an 
issue) . We have studied the effect of rescaling from V — I to 
Li/Lv = lO"'' *'^"^-', with a curious result. Because one of 
the marks is a monotonic (in this case exponential) trans- 
formation of the other, it is natural to wonder if My-i or 
^Li/Lv shows a clearer signal. One might reasonably expect 
that the distribution oi V — I will have smaller tails than 
that of Li/Lv- Since Mv-i increases on small scales, one 
might expect this increase to be even more pronounced for 
Mli/Ly ■ On the other hand, since the precision of the mea- 
surement depends on the variance of the mark, one might 
expect Ml^/l^ to be the noisier measurement. However, 
what matters is not the distribution of the mark itself, but 
that of the mark normalized by its mean value. It happens 
that, upon normalizing by the mean of the rescaled mark, 
the distributions of — 7 and Lj/Lv are almost indistin- 
guishable! Hence, the resulting marked statistics are also 
almost indistinguishable. 

It is possible to be more quantitative about this point. 
Figures |7| and |5| show that while the qualitative scale de- 
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pendence of (1 + W)/{1 + ^) does not depend on rescal- 
ing, it does matter quantitatively. This raises the question 
of whether or not rescaling is advisable: if rescaling pro- 
duces a marked statistic that differs more from unity than 
did the original mark, then perhaps rescaling is desirable. 
For instance, suppose one rescales to a distribution which 
has longer tails than the original [e.g., replace all marks w 
with exp('u;)]. If M(r) > 1 on small scales originally, then 
it is possible that after rescaling, M(r) is even larger than 
before. To address whether this represents a real improve- 
ment in signal, one must also consider the precision with 
which the rescaled marked statistic can be measured. Thus, 
the relevant question is whether or not this rescaling has 
also increased the statistical significance of the difference 
from unity. Appendix A provides a discussion of the preci- 
sion with which marked statistics can be estimated. Equa- 
tion 1A2II there shows that the ratio of the rms value of 
the marked statistic to its mean value depends on the ra- 
tio of {w\w2\r) to (wiiU2|f)^. Hence, rescaling is desirable if 
W'^W^ /[WW]^ for the rescaled distribution is smaller than 
for the original distribution, or if the difference between the 
WW/DD and unity, when expressed in units of the rms, 
[1 - DD/WW]/y'W^W'^/[WW]^ - 1/DD, is larger. 



5 DISCUSSION 

Weighting galaxies by different marks (luminosity, star for- 
mation rate, etc.) yields datasets which are each biased dif- 
ferently relative to the underlying dark matter distribution. 
Study of marked statistics allows one to address issues such 
as: Which marks, if any, result in a weighting of the galaxy 
distribution which minimizes the bias relative to the dark 
matter? Over what redshift range is this mark the least bi- 
ased? Cross correlations of marks also allow one to study, 
for example, if star formation rate. X-ray hardness or AGN 



activity are correlated with local density; if the color of a 
galaxy is correlated with the luminosity of its neighbour (the 
color of a galaxy is known to be strongly correlated with its 
own luminosity); and if luminosity dependent clustering is 
really due to morphological segregation. 

A discussion of the bias and variance of estimators of 
marked statistics is provided in Appendix^ We also illus- 
trated the use of marked statistics by showing how they be- 
have in semi-analytic galaxy formation models. The models 
predict that the luminosity-weighted corrrelation function 
should depend strongly on waveband (Figure |2|l: close pairs 
should be more luminous than more widely separated pairs 
if the luminosity is measured at long wavelengths such as 
the A'-band. If measured using the light from shorter wave- 
lengths such as the B-band, the trend should be reversed: 
close pairs should be less luminous. If the models are correct, 
then luminosity marked correlations from the HST ACS and 
2MASS should resemble the bottom left and right panels, 
respectively — they should be very different from each other. 

Marked statistics also show that, in the models, close 
pairs of galaxies (separations smaller than ~ 1 Mpc) have 
redder colors and smaller star formation rates than more 
widely separated pairs (Figure 2J, suggesting that objects 
in dense regions regions are redder and have smaller star 
formation rates. These trends are similar to the morphology- 
density (Dressier 1980) and star-formation rate-density re- 
lations (Lewis et al. 2002; Gomez et al. 2003). In addition, 
the stellar masses of objects in dense regions are larger, and 
the ratio of the typical stellar mass in dense regions to the 
average value is larger today than it was in the past (Fig- 
ure IKJ. 

We also showed that marked statistics provide simple 
tests of how correlations between observables depend on 
environment. As a specific example, the analysis of Sec- 
tion l2.6l when combined with the measurements of Mstars ('') 
and i\/LK (Figure 01 and the fact that Lk oc Mstars (Fig- 
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ure (HJ suggests that the mass-to-hght ratio is larger in 
dense regions. It will be interesting to make similar mea- 
surements of marked statistics to determine whether or not 
other well-known correlations between galaxy observables 
correlate with environment. For instance, to test if the Fun- 
damental Plane populated by early-type galaxy observables 
depends on environment, one would set m = size and w = 
the Fundamental Plane combination of surface brightness 
and velocity dispersion. 

The virtue of marked statistics for identifying and quan- 
tifying trends with environment is that they do not rely on 
a specific choice of scale on which to define the larger scale 
environment, nor do they require a subjective division of 
galaxy environments into cluster and field. Indeed, marked 
correlations are particularly well-suited to studying corre- 
lations between galaxy properties and their environment, 
when the correlations are weak (Sheth & Tormen 2004). Be- 
cause marked correlations do not pre-select a specific scale 
on which to study correlations with environment, they of- 
fer a powerful and complimentary approach to more tra- 
ditional techniques which do require specific definitions of 
environment (e.g. Kauffmann et al. 2004; Hogg et al. 2005). 
Quantifying such correlations is interesting because galaxy 
formation models make specific predictions for environmen- 
tal trends. This is because these models assume that galaxies 
form within dark matter halos, and the physical properties 
of galaxies are determined by the halos in which they form 
(White & Rees 1978; White & Frenk 1991). Therefore, the 
statistical properties of a given galaxy population are deter- 
mined by the properties of the parent halo population. 

The galaxy formation models of Kauffmann et al. (1999) 
and Cole et al. (2000) are based on this paradigm: they make 
rather similar assumptions for the complicated gastrophysics 
of galaxy formation. However, they make different assump- 
tions about the resulting correlations with environment. In 
particular, because Kauffmann et al. (1999) use halo merger 
history trees from the simulations, their models include cor- 
relations between halo formation histories and environment. 
In contrast. Cole et al. (2000) use Monte Carlo merger his- 
tories, which explicitly ignore any correlation between halo 
formation and environment. Marked correlation functions 
provide an efficient way of quantifying the differences which 
result. In this regard, Skibba, Sheth & Connolly (2006) show 
how the observed luminosity dependence of clustering, when 
combined with a measurement of the luminosity weighted 
correlation function allows a sensitive test of the environ- 
mental dependence of galaxy clustering. 

There are two main disadvantages associated with us- 
ing marked statistics. First, although considerable effort has 
been spent in developing accurate models of how the galaxy 
correlation function depends on scale and time (see Cooray 
& Sheth 2002 for a review), there has been little parallel 
development of how marked correlations are expected to de- 
pend on scale and time. Hence, interpretting the measure- 
ments is not straightforward. However, this is beginning to 
change: Sheth, Abbas & Skibba (2004) show how to describe 
the weighted correlation function using the halo-model of 
large-scale structure, when galaxies are associated with halo 
substructure, and Sheth (2005) shows how the halo model 
can be used to describe all the marked statistics described 
here. 

Second, we have assumed that all analysis of marked 
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Figure 10. Comparison of statistic when normalized stellar mass 
is the mark. Top curve with error bars shows the result when the 
stellar mass is that from the GIF galaxy formation simulation, 
whereas dashed curve shows the statistic after applying a mono- 
tonic rescaUng to the marks (shown as the inset in the upper right 
corner), so that they match the distribution of stellar masses re- 
ported by Cole et al. (2001). 



statistics will be performed on volume limited catalogs. 
While direct measurement (using the estimator discussed 
here) of marked statistics in magnitude limited surveys is 
straightforward, interpretation is more complicated. For in- 
stance, one might use the halo-model framework to model 
the measurement. Development of a measurement algorithm 
which accounts for the effects of the magnitude limit is the 
subject of work in progress. There is, in addition, the in- 
convenience associated with the fact that our analysis as- 
sumes that the marks are positive definite, but arguments 
in Section 2] suggest that monotonic rescaling to a distribu- 
tion which is positive definite will still yield useful results. 
Of course, all this assumes that the marks themselves have 
been accurately and precisely determined from the data. 

Marked statistics are sensitive to the distribution of 
marks in the dataset. Because it is likely that the semi- 
analytic models will not match the observed distribution 
of galaxy properties (e.g., the current generation of models 
does not match the SDSS luminosity function, and the dis- 
tribution of stellar masses does not match that infered from 
2MASS), we envision that constraints on models will take 
the following form. If the models successfully describe the 
luminosity dependence of clustering, then they have success- 
fully identified the halos in which galaxies form. Because the 
marked correlations are not changed in any essential way if 
the absolute values of the weights are altered without chang- 
ing the rank ordering (Section |^, marked statistics provide 
a way of determining if, in addition to correctly identifying 
the halos in which galaxies form, the models also determine 
the correct rank ordering of galaxy properties within halos, 
if not their absolute values. 

Figure 1101 illustrates this procedure. It shows (1 -f 
W)/{1 + 5) when stellar mass is the mark: the top curve 
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with error bars shows the measurement in the GIF sim- 
ulations (i.e., it is the same as in Figure]^, whereas the 
bottom dashed curve shows the result of rescaling the stel- 
lar masses so that they match the distribution reported by 
Cole et al. (2001) from their analysis of 2dF and 2MASS. 
To do this, we assumed that the marks in the GIF simu- 
lation are for the most massive galaxies in the simulation 
box. We then matched cumulative number densities, mean- 
ing that we worked our way down the GIF and Cole et al. 
stellar mass functions, assigning new masses to the semi- 
analytic galaxies until all 14665 objects had been assigned 
a 2dF-2MASS mass. Finally, we normalized these rescaled 
masses by their mean. The original and rescaled marks, nor- 
malized by their mean values, are shown in the inset in the 
top right corner of the Figure. This shows that, after rescal- 
ing, the marks span a smaller range of values around their 
mean value than they did originally. This has the effect of 
decreasing the amplitude of the rescaled marked statistic, 
but does not change the qualitative trend for close pairs to 
be more massive. More importantly, this illustrates that if 
the distribution of marks in the models does not match the 
data, then the marked statistic will also be discrepant. 

The measurements presented here serve as benchmarks 
against which to compare measurements from data such 
as the SDSS (Abazajian et al. 2005). Measurements in the 
SDSS database using color, velocity dispersion, effective ra- 
dius, morphology, star formation rate indicators, local den- 
sity, and so on, as marks are underway (Skibba, Sheth & 
Connolly 2006). Comparison with predictions such as those 
described above will constrain the galaxy formation models. 
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APPENDIX A: ERROR ESTIMATES 

In what follows, we provide error estimates of the marked 
statistic most studied in the main text, the ratio of the 
weighted and unweighted correlation functions: (H- W) /(1 + 
5). We do this in two steps; the first assumes the marks 
are perfectly measured, and the second includes the effects 
of measurement errors. In either case, the analysis assumes 
that the mean mark can be reliably determined from the 
data. Our analysis follows that of Peebles (1980), Mo, Jing & 
Borner (1992), Landy & Szalay (1993) and Bernstein (1994). 
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Al An unbiased estimator 

We estimate the statistic as WW/ DD, wlicrc WW is the 
sum over all pairs with separation r, weighting each mem- 
ber of the pair by its mark, and DD is the total number 
of such pairs. In what follows, we assume that the marks 
have all been normalized by the mean mark, so that DD 
really is the total number of pairs. (We discuss an alterna- 
tive procedure, in which DD is replaced by WsWs, which 
denotes the weighted pair counts in which the weights have 
been scrambled, shortly.) Let (W^H^) and {DD) denote the 
mean values of these statistics, where the mean is over an 
ensemble of realizations of the point process. The counts in 
any particular realization differ from this ensemble average 
value, so that 



ww \ _ {WW) n + Lj 

DD / ~^ {DD) \ l + 5 



(Al) 

The final expression follows from assuming 5 and keep- 
ing terms to second order. Now, (w) = and {5) = by defi- 
nition, so the average of the ratio (the left hand side) equals 
the ratio of the averages (the right hand side) if {u}5) = {5^). 
We can estimate these terms as follows. Let 

DD^ = Y,ninj Qij (r) = f/z (r ) ( A2) 



WW\ =J2^^iU>je)ij{r) = (wiW2\r\U2{r), (A3) 



and 



i<j 

where U2{r) is proportional to the total number of pairs, 
N{N — l)/2, times a geometrical factor, G2{r), which rep- 
resents the fraction of these pairs which have separation r, 
given the survey geometry (see, e.g., Landy & Szalay 1993). 
Then 



{DD ■ DD) 
{DD){DD) 



1 + 



UA{r) + U;{r) + U2{r) 



(A4) 



(72 (r) 2 

where U^ir) = N{N-l){N-2){N-3)/4 Gi(r), and Uair) = 
N{N - 1){N - 2)G3(r) are products of the total number 
of quadruples and triples times geometrical factors, defined 
similarly to U2{r). Similarly, 



{WW- DD) 
{WW){DD) 



= 1 + (uj5 



J2i<i ■w^^^'3 Q^jir) Efc<i ^kUi Qkiir) 



mean value, but had replaced DD by VK^VKs, then the es- 
timator would be unbiased provided we treated each of the 
Ws terms as being drawn from a different realization of the 
scrambled marks. This is because terms of the form (w^) in 
the expression for (WsVKs ■ WsWs) lead to a bias. Thus, the 
weights in the numerator of WW/WsWs must be treated dif- 
ferently from those in the denominator, making the notation 
somewhat confusing. This is why we prefer to use WW/DD 
(with the understanding that the weights have been normal- 
ized by their mean value), rather than WW/WgWe as our 
estimator. 



A2 Variance within a bin 

The variance of the estimator is 



Var 



WW 



DD 



/ U+^J \ / 
\(l-h<5)2/ \ 



{WW)^ /(l+oj) 
{DD)'^ 

(WW)^ 
{DD)^ 
(WW)^ 
{DD)^ 



/WW\2 
DD / 



(A7) 



where the final expression uses the fact that {ui6) = (5^). 
The sum in quadrature of the variances of WW and 

DD is proportional to {lu'^) + {5^) . Notice that tlio cxijrcssion 
above scales as the difference of these two terms rather than 
than the sum, so it can be substantially smaller. This reflects 
the fact that, if WW for a given realization of the point 
process is larger than the ensemble mean, it may simply be 
large because DD is also large (i.e., there were more pairs). 
As a result, fluctuations in WW and DD are correlated. 
The estimate of the true variance above accounts for this 
correlation. 

To see what it is, note that 



= 1 + 



(WW ■ WW) 
{WW) {WW) 
J2i<j WiWj Oijjr) J2k<i WkWi Qkiir) 

{wiW2\r)^ f/2(r)2 
U4{r) _^ {wfw2W3\r) Uajr) + {wfw^\r) U2{r) 



U^{r) (wiW2|r)2t72(r)2 
so the ratio of the variance to the square of the mean is 



(A8) 



V&r[WW/DD] 
{WW/DD)^ 



{wiW2W3\r) — {wiW2\r)^ Usir) 

WW 



{ruiru2\r)^ 

'•i\r) - {iriti-2 



'V" 



{wiW2\ry^ 



(72(r)2 



(A9) 



On large scales, where correlations tend to be weak, Ua w 
iVgai(ndV)2(l + and U2 « Ng^indV{l + 0/2 so 



{wiW2\r) ?/2(r)2 

(wiW2|r) 1/4(7-) -I- {wiW2\r) U-sir) + {wiW2\r) iM^-Zyi ~ 2(1 + Hence, the first term in the expres- 



(wiW2|r) U2{rY 



I + {5 



Thus, 



WW \ _ {WW) _ 
DD / ~ {DD) ' 



(A6) 



sion above scales as l/A'gai. The second term scales inversely 
with the number of pairs, so, for a sufficiently large survey, 
I^Jirst term dominates on large scales, whereas the second 
term dominates on small scales. 

The second term scales sensibly as the inverse of the 
pair counts, times a term which resembles a variance of mark 
pairs. It is useful to think of this term as 



this estimator is unbiased. 

Notice that if we had not normalized the marks by their 



{w'iw'ilr) 
(wiW2|r)2 



'U2{r) 



M„2(r) {wY 



Mg{r) {w)^ 



'U2{r); 
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here M^2 (r) denotes the statistic when the mark is the 
square of the weight, rather than the weight itself. Thus, 
this term can be estimated directly from the data as 
W^W^/IWW]^ - 1/DD. Notice that if M„2 = Ml, then 
this term can be written in terms of the variances of w and 
w'^ , so the only scale dependence comes from the factor of 
U2- Figure|n|in the main text shows that this approximation 
may not be wildly-off in astrophysical datasets. 

The first term is more problematic; its presence suggests 
that a better estimator than WW/ DD for our marked statis- 
tic can be found, just as a better estimator than DD/ RR 
can be found for the unweighted statistic 1+^. Writing down 
this alternative estimator is beyond the scope of the present 
paper. However, to see the effect of this extra term, suppose 
that there are no correlations, either between the points, or 
between the marks. This is not an unreasonable assump- 
tion on the large scales where the first term is expected to 
dominate. In this limit, 



Va.r[WW/DD] 
lyVW/DDY 



(it)) 2 



2 



(AlO) 



The form of this expression suggests that the variance should 
be well approximated by adding 



{w''\r) 



{wiW2\r) 



- 1 



2[l + e(r) 



gal 



(All) 



to W^W^/[WWf - 1/DD. FigureOlin the main text illus- 
trates how these two terms are expected to contribute to the 
error budget in typical astrophysical datasets. 

The analysis so far does not account for the possibility 
that, if the dataset is small, then the estimated pair counts 
are not from a fair sample, so they will be biased. This intro- 
duces additional terms into the error budget, which can be 
estimated by relatively straightforward but tedious exten- 
sion of the analysis in Landy & Szalay (1993). Also absent 
from this error budget is the fact that small datasets may 
not provide accurate measurements of the mean mark. Since 
our analysis assumes that all marks have been normalized 
by this mean measured in the dataset, this introduces an 
additional source of error. A more detailed account of these 
two additional sources of error will be presented elsewhere. 
Note that astrophysical datasets are now becoming suffi- 
ciently large that small sample issues are less likely to be a 
serious concern, so the estimates here should be reasonably 
accurate. 



U2{n)U2{r,)' 
1 + (w,Sj) = 1 + (5,5,), (A14) 



and we have set (ujiSi) — {5f) . Hence, the covariance depends 
on 

{D^D,}{D,D,)-^ + ^^'^'^-^ + rr r.-.rr r.^' (^13) 
{WM^}{D,D,) 

and 

{W^W^ ■ WjWj) 

{wm{w,w,) 

with 



1+ { WiW.. 



l + Vy3(n,r,)-iM!liIiI^, (A15) 



C/2(rO?72(r,)' 



{wiW2W3\ri,rj) 
{wiW2\ri) {wiwslrj) 



(A16) 



A4 Measurement errors 

The analysis above assumes the marks are perfectly mea- 
sured. If, instead, the marks are imprecisely measured, but, 
while imprecise, are unbiased, and if the measurement er- 
ror is not correlated with spatial position, then the previous 



expressions hold with {uj ) 



+ e ) where (e ) denotes 



the variance of the error on mark. Hence, the measurement 
errors do not bias the marked statistic, but they do increase 
Va,r[WW/DD], thus decreasing the precision of the measure- 
ment. Of course, if the measurement of the marks themselves 
is systematically biased, then the marked statistics will also 
be biased. 



A3 Covariance between bins 

In this case, we are interested in 



cov 



WiWi W2W2 



DiDi D2D2 



{WlWl){W2W2) 



(^SiS2 + UJILV2 



{DiDi)(D2D2) 

-L01S2 - uJ25i'^ (A12) 

where we have used the fact that 

WlWlW2W2\ {WlWl){W2W2) 



DiDi D2D2 



{DiDi){D2D2} 

— LJl<5l — LU252 + ljJ\ijJ2 — ijJ\&2 ~ ^2&\^ 



